Learning and Fusing Multi-View Code Representations for Function Vulnerability Detection

نویسندگان

چکیده

The explosive growth of vulnerabilities poses a significant threat to the security software systems. While various deep-learning-based vulnerability detection methods have emerged, they primarily rely on semantic features extracted from single code representation structure, which limits their ability detect hidden deep within code. To address this limitation, we propose S2FVD, short for Sequence and Structure Fusion-based Vulnerability Detector, fuses vulnerability-indicative learned multiple views more accurate detection. Specifically, S2FVD employs either well-matched or carefully extended neural network models extract token sequence, attributed control flow graph (ACFG) abstract syntax tree (AST) representations function, respectively. These capture different perspectives code, are then fused enable accurately that well-hidden function. experiments conducted two large datasets demonstrated superior performance against state-of-the-art approaches, with its accuracy F1 scores reaching 98.07% 98.14% respectively in detecting presence vulnerabilities, 97.93% 97.94%, respectively, pinpointing specific types. Furthermore, regard real-world dataset D2A, achieved average gains 6.86% 14.84% terms metrics, over baselines. This ablation study also confirms superiority fusing semantics implied distinct further enhance performance.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multi-View Learning for Web Spam Detection

Spam pages are designed to maliciously appear among the top search results by excessive usage of popular terms. Therefore, spam pages should be removed using an effective and efficient spam detection system. Previous methods for web spam classification used several features from various information sources (page contents, web graph, access logs, etc.) to detect web spam. In this paper, we follo...

متن کامل

Multi-View Perceptron: a Deep Model for Learning Face Identity and View Representations

Various factors, such as identity, view, and illumination, are coupled in face images. Disentangling the identity and view representations is a major challenge in face recognition. Existing face recognition systems either use handcrafted features or learn features discriminatively to improve recognition accuracy. This is different from the behavior of primate brain. Recent studies [5, 19] disco...

متن کامل

Statistical Learning of Multi-view Face Detection

A new boosting algorithm, called FloatBoost, is proposed to overcome the monotonicity problem of the sequential AdaBoost learning. AdaBoost [1, 2] is a sequential forward search procedure using the greedy selection strategy. The premise oÿered by the sequential procedure can be broken-down when the monotonicity assumption, i.e. that when adding a new feature to the current set, the value of the...

متن کامل

Learning Probabilistic Distribution Model for Multi-View Face Detection

Modeling subspaces of a distribution of interest in high dimensional spaces is a challenging problem in pattern analysis. In this paper, we present a novel framework for pose invariant face detection through multi-view face distribution modeling. The approach is aimed to learn a set of low-dimensional subspaces from an originally nonlinear distribution by using the mixtures of probabilistic PCA...

متن کامل

tight frame approximation for multi-frames and super-frames

در این پایان نامه یک مولد برای چند قاب یا ابر قاب تولید شده تحت عمل نمایش یکانی تصویر برای گروه های شمارش پذیر گسسته بررسی خواهد شد. مثال هایی از این قاب ها چند قاب های گابور، ابرقاب های گابور و قاب هایی برای زیرفضاهای انتقال پایاست. نشان می دهیم که مولد چند قاب تنک نرمال شده (ابرقاب) یکتا وجود دارد به طوری که مینیمم فاصله را از ان دارد. همچنین مسایل مشابه برای قاب های دوگان مطرح شده و برخی ...

15 صفحه اول

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Electronics

سال: 2023

ISSN: ['2079-9292']

DOI: https://doi.org/10.3390/electronics12112495